Appendix 4 



Alignment of dengue virus polyproteins 



DEN4 




1 


MNORKKVVRPPFNMLKRERNRVSTPOGLVKR^ 


49 


DEN1 


-WP 


1 


MNNQRKKTGRP S FNML KRARNRVS TVS QLAKRF S KGLL S GQGPM KL VMAF 


50 


DEN2 


-NGC 


1 


MNNQRKKARNT PFNML KRERNRVS TVQQLTKRF S LGMLQGRGPL KL FMAL 


50 


DEN3 


-H87 


1 


MNNORKKTGKP S I NML KRVRNR VS TG S OLAKRF S RGLLNGOGPMKLVMAF 


50 








***** ***** ****** * **** *. * # ** t m * 




DEN4 




50 


I TFLRVLS I PPTAG I LKRWGQLKKNKAI KI L IGFRKE I GRMLNI LNGRKR 
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I AFLRFLAI PPTAG I LARWGS FKKNGAI KVLRGFKKE I SNMLNI MNRRKR 
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VAFLRFLTIPPTAGILKRWGTIKKSKAINVLRGFRKEIGRMLNILNRRRR 
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-H87 


51 


I AFLRFLAI PPTAGVLARWGTFKKSGAI KVLKGFKKE I SNMLS I INKRKK 


100 
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STITLLCLIPTVMAFSLSTRDGEPLMIVAKHERGRPLLFKTTEGINKCTL 
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S VTMLLMLL PTALAFHLTTRGGE PHM I VS KQERGKS LLFKTS AGVNMCTL 
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TAGMT TMLI PTVMAFHLTTRNGEPHMIVSROEKGKSLLFKTEDGVNMCTL 
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TSLCLMMMLPATLAFHLTSRDGEPRMIVGKNERGKSLLFKTASGINMCTL 
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IAMDLGEMCEDTVTyKCPLLVNTEPEDIDCWCNLTSTWVMYGTCTQSGER 
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151 


I AMDLGELCEDTMT YKCPR I TETE PDDVDCWCNATETWVT YGTCS QTGEH 
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MAMDLGELCEDTITYKCPFLRQNEPEDIDCWCNSTSTWVTYGTCTTTGEH 
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IAMDLGEMCDDTVTYKCPHITEVEPEDIDCWCNLTSTWVTYGTCNQAGEH 


200 
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RRDKRSVALAPHVGMGLDTRTQTWMSAEGAWRQVEKVETWALRHPGFTIL 
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AAI LAYT I GTTHFQRAL I F I LLTAVAPS MTMRC I G I SNRDF VEGVSGGS W 


300 


UEtVi -J 


no / 


4* J X 


AT.PT.AHYTnT^T.TOKVVTFTTiTiMTjVTP^MTMRCVGVGNRDFVEGLSGATW 


300 








* * ** * _ *.*. *.** ***,*. ********* # * 




DEN4 




300 


VDLVLEHGGCVTTMAQGKPTLDFELTKTTAKEVALLRTYCIEASISNITT 
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VDVVLEHGSCVTTMAKDKPTLD IELLKTEVTNPAVLRKLC I EAKI SNTTT 
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VD I VLEHGS CVTTMAKNKPTLDFEL I KTEAKQPATLRKYC I EAKLTNTTT 
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VD WLEHGGC VTTMAKNKPTLD I ELQKTEATQLATLRKLC I EGKI TN I TT 
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ATRC PTQGE P YLKEEQDQQ Y I CRRD WDRGWGNGCGLFGKGGWTCAKFS 
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DSRCPTQGEATLVEEQDTNFVCRRTFVDRGWGNGCGLFGKGSLITCAKFK 
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DSRCPTQGEPSLNEEQDKRFVCKHSMVDRGWGNGCGLFGKGGIVTCAMFT 
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CSGKI TGNL VQ I ENLE YTVVVTVHNGDTHAVGNDTSNHGVTAM I TPRS PS 44 9 

CVTKLEGKI VQYENLKYSVI VTVHTGDQHQVGNETTEHGTTATITPQAPT 450 

CKKNMKGKWQPENLEYTIVITPHSGEEHAVGNDTGKHGKEIKITPQSSI 450 

CLESIEGKWQHENLKYTVIITVHTGDQHQVGNET- - QGVTAE I TS QAS T 44 8 
* t * # ** **★ *....* * * . * * ** . * .* ** . . 

VEVKLPDYGELTLDCEPRSGIDFNEMILMKMKKKTWLVHKQWFLDLPLPW 4 99 

SEIQLTDYGALTLDCSPRTGLDFNEMVLLTMEKKSWLVHKQWFLDLPLPW 500 

TEAELTGYGTVTMECS PRTGLDFNEMVLLQMENKAWLVHRQWFLDLPLPW 500 

AEAILPEYGTLGLECSPRTGLDFNEMILLTMKNKAWMVHRQWFFDLPLPW 4 98 
* * ** #> * ** m * m ***** m * m * ******* ****** 

TAGADTSEVHWNYKERIWTFKVPHAKRQDVTVLGSQEGAMHSALAGATEV 54 9 

TSGASTSQETWNRQDLLVTFKTAHAKKQEVWLGSQEGAMHTALTGATEI 550 

LPGADTQGSNWIQKETLVTFKNPHAKKQDVWLGSQEGAiyiHTALTGATEI 550 

TSGATTKTPTWNRKELLVTFKNAHAKKQEVWLGSQEGAMHTALTGATEI 54 8 
** * * **** ***** ********** t ****** . 

DSGDGNHMFAGHLKCKVRMEKLRIKGMSYTMCSGKFSIDKEMAETQHGTT 5 99 

QTSGTTTIFAGHLKCRLKMDKLTLKGMSYVMCTGSFKLEKEVAETQHGTV 600 

QMS SGNLLFTGHLKCRLRMDKLQLKGMS YSMCTGKFKWKE I AETQHGT I 600 

QTSGGTS I FAGHLKCRLKMDKLKLKGMS YAMCLNTFVLKKEVSETQHGT I 598 
a * a ***** m m m * m ** ,★**** ** m * ** t ****** 

WKVKYEGAGAPCKVP I E I RDWKE KWGR 1 1 S S TPLAENTNS VTN I E LE 64 9 

LVQVKYEGTDAPCKIPFSSQDEKGVTQNGRLITANPIVTDKEKPVNIEAE 650 

VIRVQYEGDGSPCKIPFEIMDLEKRHVLGRLITVNPIVTEKDSPVNIEAE 650 

LIKVEYKGEDAPCKIPFSTEDGQGKAHNGRLITANPWTKKEEPVNIEAE 64 8 



PPFGDSYIVIGVGNSALTLHWFRKGSSIGKMFESTYRGAKRMAILGETAW 699 

PPFGESYIWGAGEKALKLSWFKKGSSIGKMFEATARGARRMAILGDTAW 700 

PPFGDSYIIIGVEPGQLKLNWFKKGSSIGQMIETTMRGAKRMAILGDTAW 700 

PPFGESNIVIGIGDKALKINWYRKGSSIGKMFEATARGARRMAILGDTAW 698 
***** *..* * . *..*****★.* *,* *** . ****** m ** * 

DFGSVGGLFTSLGKAVHQVFGSVYTTMFGGVSWMIRILIGFLVLWIGTNS 749 

DFGSIGGVFTSVGKLIHQIFGTAYGVLFSGVSWTMKIGIGILLTWLGLNS 750 

DFGSLGGVFTSIGKALHQVFGAIYGAAFSGVSWTMKILIGVIITWIGMNS 750 

DFGSVGGVLNSLGKMVHQIFGSAYTALFSGVSWIMKIGIGVLLTWIGLNS 74 8 
****** 4 *** > **** i * * **** 4> * ** ma * 4 * ** 

RNTSMAMTCIAVGGITLFLGFTVQADMGCVASWSGKELKCGSGIFVVDNV 799 

RSTSLSMTCIAVGMVTLYLGVMVQADSGCVINWKGRELKCGSGIFVTNEV 800 

RSTSLSVSLVLVGWTLYLGVMVQADSGCWSWKNKELKCGSGIFITDNV 800 

KNTSMSFSCIAIGIITLYLGVWQADMGCVINWKGKELKCGSGIFVTNEV 798 

** ★ ** ** **** *** * ^********* # * 

HTWTEQ YKFQPE S P ARLAS AI LNAHKDGVCG I RSTTRLENVMWKQ I TNEL 84 9 

HTWTEQYKFQADSPKRLSAAIGKAWEEGVCGIRSATRLENIMWKQISNEL 850 

HTWTEQYKFQPESPSKLASAIQKAHEEGICGIRSVTRLENLMWKQITPEL 850 

HTWTEQYKFQADSPKRVATAIAGAWENGVCGIRSTTRMENLLWKQIANEL 84 8 



********** t ** ....** * * # ***** ** ** e# **** # ** 
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************ m ****** * ,** * m * m * * > m m ** m 

DEN4 1150 RRRVTRKHMILVWITLCAIIIiGGLTW^4DLLRALIMLGDTMSGRIG-GQI 1198 

DEN1-WP 1151 RSRWSRKMLMTGTLAVFLLLTMGQLTWNDLIRLCIMVGANASDKMGMGTT 12 00 

DEN2-NGC 1151 RTRVGTKHAI LLVAVS FVTL ITGNMS FRDLGRVMVMVGATMTDDIGMGVT 1200 

DEN3-H87 1149 RGKFGKKHMIAGVLFTFVLLLSGQITWRGMAHTLIMIGSNASDRMGMGVT 1198 
* . * . . *.. .*.*.. . * * 

DEN4 1199 HLAIMAVFKMS PGYVLGVFLRKLTSRETALMVIGMAMTTVLS I PHDLMEL 1248 

DEN1-WP 1201 YLALMATFRMRPMFAVGLLFRRLTSREVLLLTVGLSLVASVELPNSLEEL 1250 

DEN2-NGC 1201 YLALLAAFKVRPTFAAGLLLRKLTSKELMMTTIGIVLLSQSTIPETILEL 1250 

DEN3-H87 1199 YLALIATFKIQPFLALGFFLRKLTSRENLLLGVGLAMAATLRLPEDIEQM 1248 
* * * * * * ***** * * 



DEN4 124 9 I DG I S LGL I LL K I VTQ FDNTQVGTLALS LT FIRS TM PL VMAWRT I MAVL F 12 98 

DEN1-WP 1251 GDGLAMGIMMLKLLTDFQSHQLWATLLSLTFVKTTFSLHYAWKTMAMILS 13 00 
DEN2-NGC 1251 TDALALGMMVLKMVRKMEKYQLAVT I MA I LCVPNAV I LQNAWKVS CT I LA 1300 
DEN3-H87 1249 ANGIALGLMALKLITQFETYQLWTALVSLTCSNTIFTLTVAWRTATLILA 1298 

...*..**.. * . ... * ** . . * 
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DEN4 12 99 WTLIPLCRTSCLQKQSHWVEITALILGAQALPVYLMTLMKGASRRSWPL 134 8 

DEN1-WP 1301 IVSLFPLCLSTTSQK-TTWLPVLLGSLGCKPLTMFLITENKIWGRKSWPL 1349 

DEN2-NGC 1301 WSVSPLFLTSSQQK-ADWIPLALTIKGLNPTAIFLTTLSRTNKKRSWPL 134 9 

DEN3-H87 1299 GISLLPVCQSSSMRK-TDWLPMTVAAMGVPPLPLFIFSLKDTLKRRSWPL 1347 



DEN4 134 9 NEG IMAVGLVSLLGS ALLKNDVPLAGPMVAGGLLLAAYVMSGS S ADLSLE 13 98 

DEN1-WP 1350 NEGIMAVGIVSILLSSLLKNDVPLAGPLIAGGMLIACYVISGSSADLSLE 1399 

DEN2-NGC 1350 NEAIMAVGMVSILASSLLKNDIPMTGPLVAGGLLTVCYVLTGRSADLELE 1399 

DEN3-H87 134 8 NEGVMAVGLVSILASSLLRNDVPMAGPLVAGGLLIACYVITGTSADLTVE 13 97 



**** ********* 



* **★* 



DEN4 1399 KAANVQWD EMAD I TGS S P 1 1 E VKQD EDG S F S I RD VE E TNM I TLL VKL AL I 1448 

DEN1-WP 1400 KAAEVSWEEEAEHSGASHNILVEVQDDGTMKIKDEERDDTLTILLKATLL 1449 

DEN2-NGC 1400 RAADVKWEDQAE I SGS S P I LS ITI SEDGSMS I KNEEEEQTLT ILIRTGLL 1449 

DEN3-H87 1398 KAADVTWEEEAEQTGVSHNLMITVDDDGTMRIKDDETENILTVLLKTALL 1447 



DEN4 144 9 TVSGLYPLAIPVTMTLWYMWQVKTQRSGALWDVP3PAATKKAALSEGVYR 14 98 

DEN1-WP 1450 AISGVYPMSIPATLFVWYFWQKKKQRSGVLWDTPSPPEVERAVLDDGIYR 1499 
DEN2-NGC 1450 VIS GL F P VS I P I TAAAW YLWE VKKQRAG VLWD VP S P P P VGKAE LEDGAYR 1499 
DEN3-H87 1448 IVSGIFPYSIPATMLVWHTWQKQTQRSGVLWDVPSPPETQKAELEEGVYR 1497 
.**..* .** * * *. **.* *** *** t * * ,* ** 

DEN4 1499 I MQRGLFGKTQVGVG I HMEGVFHTMWHVTRGS VI CHETGRLE PS WADVRN 1548 

DEN1-WP 1500 ILQRGLLGRSQVGVGVFQEGVFHTMWHVTRGAVLMYQGKRLEPSWASVKK 1549 
DEN2-NGC 1500 I KQKG I LG YS Q I GAGVYKEGTFHTMWHVTRGAVLMHKGKR I E PS WADVKK 1549 
DEN3-H87 1498 I KQQG I FGKTQVGVGVQKEGVFHTMWHVTRGAVLTHNGKRLE PNWAS VKK 1547 
* *.*. * .*.* *. ** **********.*. *^** *★ * t 

DEN4 154 9 DMISYGGGWRLGDKWDKEEDVQVLAIEPGKNPKHVQTKPGLFKTLTGEIG 15 98 

DEN1-WP 1550 DLISYGGGWRFQGSWNAGEEVQVIAVEPGKNPKNVQTAPGTFKTPEGEVG 1599 

DEN2-NGC 1550 DLISYGGGWKLEGEWKEGEEVQVLALEPGKNPRAVQTKPGLFKTNAGTIG 15 99 

DEN3-H87 1548 DLISYGGGWRLSAQWQKGEEVQVIAVEPGKNPKNFQTMPGIFQTTTGEIG 1597 

* * m *** m * t ****** m ** ** * # * * t * 

DEN4 1599 AVTLDFKPGTSGSPIINRKGKVIGLYGNGWTKSGDYVSAITQAERIGEP 1648 

DEN1-WP 1600 AIALDFKPGTSGSPIVNREGKIVGLYGNGWTTSGTYVSAIAQAKASQEG 1649 
DEN2-NGC 1600 AVSLDFS PGTSGS P I IDKKGKWGLYGNGWTRSGAYVS AI AQTEKS IED 1649 
DEN3-H87 1598 AIALDFKPGTSGSPIINREGKWGLYGNGWTKNGGYVSGIAQTNAEPDG 1647 
* tm *** ******** t m ** m ********* * *** *.*. 

DEN4 164 9 - D YEVDEDI FRKKRLTIMDLHPGAGKTKRI LPS I VREALKRRLRTL I LAP 16 97 

DEN1-WP 1650 PLPEIEDEVFRKRNLTIMDLHPGSGKTRRYLPAIVREAIRRNVRTLVLAP 1699 
DEN2-NGC 1650 -NPEIEDDIFRKRKLTIMDLHPGAGKTKRYLPAIVREAIKRGLRTLILAP 1698 
DEN3-H87 1648 PTPELEEEMFKKRNLT IMDLHPGSGKTRKYLPAI VREAI KRRLRTL I LAP 1697 

*.....*.*. *********.***.. ** m ***** ^ * ****** 

DEN4 1698 TRWAAEMEEALRGLPIRYQTPAVKSEHTGREIVDLMCHATFTTRLLSST 1747 

DEN1-WP 1700 TRWASEMAEALKGMPIRYQTTAVKSEHTGKEIVDLMCHATFTMRLLSPV 1749 
DEN2-NGC 1699 TRWAAEMEEALRGLP IRYQTPAIRAEHTGRE I VDLMCHATFTMRLLS PV 1748 
DEN3-H87 1698 TRWAAEMEEAMKGLP IRYQTTATKSEHTGRE I VDLMCHATFTMRLLS PV 1747 
***** ** ** * ****** * **** ************ **** 
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DEN4 1748 RVPNYNLIVMDEAHFTDPSSVAARGYISTRVEMGEAAAIFMTATPPGATD 1797 

DEN1-WP 1750 RVPNYNMIIMDEAHFTDPASIAARGYISTRVGMGEAAAIFMTATPPGSVE 1799 

DEN2 -NGC 1749 RVPNYNL I I MDEAH FTD PAS I AARG Y I S TRVEMGEAAG I FMTATP PGS RD 1798 

DEN3-H87 1748 RVPNYNLIIMDEAHFTDPASIAARGYISTRVGMGEAAAIFMTATPPGTAD 1797 
****** m * m ********* t * t ********** ***** *********^ 

DEN4 1798 PFPQSNSPIEDIEREIPERSWNTGFDWITDYQGKTVWFVPSIKAGNDIAN 1847 

DEN1-WP 1800 AFPQSNAVIQDEERDIPERSWNSGYDWITDFPGKTVWFVPSIKSGNDIAN 184 9 
DEN2-NGC 1799 PFPQSNAPIMDEEREIPERSWSSGHEWVTDFKGKTVWFVPSIKAGNDIAA 1848 
DEN3-H87 1798 AFPQSNAPIQDEERDI PERSWNSGNEWITDFVGKTVWFVPS IKAGNVI AN 1847 
***** t * * ** # ****** # * i * > ** < *********** m ** ** 

DEN4 1848 CLRKSGKKVIQLSRKTFDTEYPKTKLTDWDFWTTDISEMGANFRAGRVI 1897 

DEN1-WP 1850 CLRKNGKRWQLSRKTFDTEYQKTKNNDWDYV\?TTDISEMGANFRADRVI 1899 

DEN2-NGC 1849 CLRKNGKKV I QLS RKTFDS E YVKTRTNDWD F VVTTD I S EMGANFKAERV I 1898 

DEN3-H87 1848 CLRKNGKKV I QL S RKTFDTE YQKTKLND WD F WTTD I S EMGANF I ADRVI 1897 
**** ************* ** # **************** * *** 

DEN4 1898 DPRRCLKPVILPDGPERVILAGPIPVTPASAAQRRGRIGRNPAQEDDQYV 1947 

DEN1-WP 1900 DPRRCLKPVILKDGPERVILAGPMPVTVASAAQRRGRIGRNQNKEGDQYI 1949 

DEN2-NGC 1899 DPRRCMKPVILTDGEERVILAGPMPVTHSSAAQRRGRIGRNPKNENDQYI 1948 

DEN3-H87 1898 DPRRCLKPVILTDGPERVILAGPMPVTVASAAQRRGRVGRNPQKENDQYI 1947 
*****^***** ** ********,*** m ******** t *** * ***. 

DEN4 1948 FSGDPLKNDEDHAHWTEAKMLLDNIYTPEGIIPTLFGPEREKTQAIDGEF 1997 

DEN1-WP 1950 YMGQPLNNDEDHAHWTEAKMLLDNINTPEGIIPALFEPEREKSAAIDGEY 1999 
DEN2-NGC 1949 YMGEPLENDEDCAHWKEAKMLLDNINTPEGIIPSMFEPEREKVDAIDGEY 1998 
DEN3-H87 1948 FMGQPLNKDEDHAHWTEAKMLLDNINTPEGI I PALFEPEREKSAAIDGEY 1997 
* ** *** *** ********* ******* ## * ***** *****^ 

DEN4 1998 RLRGEQRKTFVELMRRGDLPVWLSYKVASAGISYKDREWCFTGERNNQIL 2 047 

DEN1-WP 2000 RLRGEARKTFVELMRRGDLPVWLSYKVASEGFQYSDRRWCFDGERNNQVL 2049 
DEN2-NGC 1999 RLRGEARKTF VDLMRRGDLP VWLAYRVAAEG INYADRRWCFDG I KNNQ I L 2048 
DEN3-H87 1998 RLKGESRKTFVELMRRGDLPVWLAHKVASEGIKYTDRKWCFDGERNNQIL 2047 
**** ***** m *********** m .**. * * ** *** * .***.* 

DEN4 2048 EENMEVEIWTREGEKJCKLRPRWLDARVYADPMALKDFKEFASGRKSITLD 2097 

DEN1-WP 2050 EENMDVEIWTKEGERKKLRPRWLDARTYSDPLALREFKEFAAGRRSVSGD 2099 
DEN2-NGC 2049 EENVEVE I WTKEGERKKLKPRWLDAR I YSD PLTLKE FKE FAAGRKS LTLN 2098 
DEN3-H87 2048 EENMDVEIWTKEGEKKKLRPRWLDARTYSDPLALKEFKDFAAGRKSIALD 2097 
*** > ***** *** *** ******* * # ** # ^ * ^ < ** ** > ** > * > a 

DEN4 2 098 ILTEIASLPTYLSSRAKLALDNIVMLHTTERGGRAYQHALNELPESLETL 2147 

DEN1-WP 2100 LILEIGKLPQHLTQRAQNALDNLVMLHNSEQGGKAYRHAMEELPDTIETL 214 9 

DEN2 -NGC 2 099 LITEMGRLPTFMTQKARDALDNLAVLHTAEAGGRAYNHALSELPETLETL 214 8 

DEN3-H87 2 098 LVTEIGRVPSHLAHRTRNALDNLVMLHTSEHGGRAYRHAVEELPETMETL 2147 

* # * ****. .**..* **.*★ **. ***...*** 

DEN4 214 8 MLVALLGAMTAGIFLFFMQGKGIGKLSMGLITIAVASGLLWVAEIQPQWI 2197 

DEN1-WP 2150 MLLALIAVLTGGVTLFFLSGRGLGKTSIGLLCVIASSALLWMASVEPHWI 2199 

DEN2 -NGC 214 9 LLLTLLATVTGG I FLFLMSGRG I GKMTLGMCC I I TAS I LLW YAQ I QPHW I 2198 

DEN3-H87 2148 LLLGLMI LLTGGAMLFLI SGKGIGKTS IGL I CVI AS SGMLWMADVPLQWI 2197 
*★ **** **** * , * ** * . .** 
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DEN4 2198 AASIILEFFLMVLLIPEPEKQRTPQDNQLIYVILTILTIIGLIAANEMGL 2247 

DEN1-WP 2200 AAS I I LE FFLMVLL I PE PDRQRTPQDNQLAYWI GLLFM I LTAAANEMGL 2249 
DEN2-NGC 2199 AASIILEFFLIVLLIPEPEKQRTPQDNQLTYWIAILTWAATMANEMGF 2248 
DEN3-H87 2198 ASAIVLEFFMMVLLIPEPEKQRTPQDNQLAYVVIGILTIjAAIVAANEMGL 2247 
* t % * m **** ^ ******* m # ********* **.. .* . ***** 

DEN4 2248 IEKTKTDFGFY QVKTETTILDVDLRPASAWTLYAVATTILTPMLRH 2293 

DEN1-WP 2250 LETTKKDLGIGHAAAENHHHAAMLDVDLHPASAWTLYAVATTIITPMMRH 2299 
DEN2-NGC 2249 LEKTKKDLGLG-SITTQQPESNILDIDLRPASAWTLYAVATTFVTPMLRH 2297 
DEN3-H87 2248 LETTKRDLGMS-KEPGWSPTSYLDVDLHPASAWTLYAVATTVITPMLRH 2296 
***** m ***************** *★*.** 

DEN4 2294 TIENTSANLSLAAIANQAAVLMGLGKGWPLHRMDLGVPLLAMGCYSQVNP 2343 

DEN1-WP 2300 T I ENTTANI SLTAI ANQAAI LMGLDKGWP I S KMD IGVPLLALGC YS QVNP 2349 
DEN2-NGC 2298 S IENSSVNVSLTAI ANQATVLMGLGKGWPLS KMD IGVPLLAIGCYSQVNP 2347 
DEN3-H87 2297 TIENSTANVSLAAIANQAWLMGLDKGWPISKMDLGVPLLALGCYSQVNP 2346 
.***.. *.**.****** ,**** ***★, ********.******** 

DEN4 2344 TTLTASLVMLLVHYAIIGPGLQAKATREAQKRTAAGIMKNPTVDGITVID 2393 

DEN1-WP 2350 LTLTAAVFMLVAHYAIIGPGLQAKATREAQKRTAAGIMKNPTVDGIVAID 2399 
DEN2-NGC 234 8 ITLTAALFLLVAHYAIIGPGLQAKATREAQKRAAAGIMKNPTVDGITVID 2 397 
DEN3-H87 2347 LTLIAAVLLLVTHYAIIGPGLQAKATREAQKRTAAGIMKNPTVDGIMTID 2396 
** * #< * m ******************** m ************* ** 

DEN4 2394 LEPISYDPKFEKQLGQVMLLVLCAGQLLLMRTTWAFCEVLTLATGPILTL 2443 

DEN1-WP 2400 LDP WYD AKFEKQLGQ I MLL I LCTS Q I LLMRTTWALCE S I TLATGPLTTL 2449 

DEN2 -NGC 2398 LDPIPYDPKFEKQLGQVMLLVLCVTQVLMMRTTWALCEALTLATGPISTL 2447 

DEN3-H87 2397 LDPVIYDSKFEKQLGQVMLLVLCAVQLLLMRTSWALCEVLTLATGPITTL 2446 
*.*. ** *★★*****,★**.** * # * # ***.** ** # ******^ ** 

DEN4 2444 WEGNPGRFWNTTIAVSTANIFRGSYLAGAGLAFSLIKNAQTPRRGTGTTG 24 93 

DEN1-WP 2450 WEGSPGKFWNTTIAVSMANIFRGSYLAGAGLAFSLMKSLGGGRRGTGAQG 2499 

DEN2-NGC 2448 WEGNPGRFWNTTIAVSMANIFRGSYLAGAGLLFSIMKNTTNTRRGTGNIG 2497 

DEN3-H87 2447 WEGSPGKFWNTTIAVSMANIFRGSYLAGAGLALSIMKSVGTGKRGTGSQG 2496 
*** *********** ************** *..* ,***★ * 

DEN4 24 94 ETLGEKWKRQLNSLDRKEFEEYKRSGILEVDRTEAKSALKDGSKIKHAVS 2543 

DEN1-WP 2500 ETLGEKWKRQLNQLSKSEFNTYKRSGIIEVDRSEAKEGLKRGEPTKHAVS 2549 

DEN2-NGC 2498 ETLGEKWKSRLNALGKSEFQIYKKSGIQEVDRTLAKEGIKRGETDHHAVS 2547 

DEN3-H87 2497 ETLGEKWKKKLNQLSRKEFDLYKKSGITEVDRTEAKEGLKRGEITHHAVS 2546 
******** # ** * m ** **.*** **** < ** p * * **** 

DEN4 2544 RGSSKIRWIVERGMVKPKGKWDLGCGRGGWSYYMATLKNVTEVKGYTKG 2593 

DEN1-WP 2550 RGTAKLRWFVERNLVKPEGKVIDLGCGRGGWSYYCAGLKKVTEVKGYTKG 2599 
DEN2-NGC 2548 RGSAKLRWFVERNMVTPEGKVVDLGCGRGGWSYYCGGLKNVREVKGLTKG 2597 
DEN3-H87 2547 RGSAKLQWFVERNMVIPEGRVIDLGCGRGGWSYYCAGLKKVTEVRGYTKG 2596 
** t t * 9 t * *** a # * * * ************* ** * **,* *** 

DEN4 25 94 GPGHEEP I PMATYGWNLVKLHSGVDVF YKPTEQVDTLLCD IGES S SNPTI 2 643 

DEN1-WP 2 600 GPGHEEP I PMATYGWNLVKLYSGKDVFFTPPEKCDTLLCDIGESSPNPTI 2 64 9 
DEN2-NGC 2598 GPGHEEPIPMSTYGWNLVRLQSGVDVFFTPPEKCDTLLCDIGESSPNPTV 2647 
DEN3-H87 2597 GPGHEEPVPMSTYGWNIVKLMSGKDVFYLPPEKCDTLLCDIGESSPSPTV 2646 
******* ** ***** * * ** *** * * *********** **. 
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DEN4 2644 EEGRTLRVLKMVE PWLS S KPEFC I KVLNP YMPTVI EELEKLQRKHGGNL V 2693 

DEN1-WP 2650 EEGRTLRVLKMVE PWLRGN-QFC I KILNPYMPSVVETLEQMQRKHGGMLV 2698 

DEN2-NGC 2648 EAGRTLRVLNLVENWLNNNTQFCIKVLNPYMPSVIEKMEALQRKYGGALV 2697 

DEN3-H87 2647 EESRT I RVLKMVEPWLKNN-QFC I KVLNP YMPTVI EHLERLQRKHGGMLV 2695 

* **^*** ^** ** **********.*.* .* *** ** ** 

DEN4 2 694 RCPLSRNSTHEMYWVSGASGNIVSSVNTTSKMLLNRFTTRHRKPTYEKDV 2743 

DEN1-WP 2699 RNPLSRNSTHEMYWVSCGTGNIVSAVNMTSRMLLNRFTMAHRKPTYERDV 2748 

DEN2-NGC 2698 RNPLSRNSTHEMYWVSNASGNIVSSVNMISRMLINRFTMRHKKATYEPDV 2747 

DEN3-H87 2696 RNPLSRNSTHEMYWISNGTGNIVSSVNMVSRLLLNRFTMTHRRPTIEKDV 2745 

* ************.* ******* * # * **** *_ * * ** 

DEN4 2744 DLGAGTRSVSTETEKPDMTIIGRRLQRLQEEHKETWHYDQENPYRTWAYH 2793 

DEN1-WP 2749 DLGAGTRHVAVE PE VANLD 1 1 GQR I EN I KNGHKS TWH YDEDNP YKTWAYH 2798 

DEN2-NGC 2748 DLGSGTRNIGIESEIPNLDIIGKRIEKIKQEHETSWHYDQDHPYKTWAYH 2797 

DEN3-H87 2746 DLGAGTRHVNAE PETPNMD VI GER I KR I KEEHS STWHYDDENP YKTWAYH 2795 
****** * * . .** * # * .**** _**.**★** 

DEN4 2794 GSYEAPSTGSASSMVNGWKLLTKPWDVIPMVTQLAMTDTTPFGQQRVFK 2 843 

DEN1-WP 2799 GS YEVKPSGSASSMVNGWRLLTKPWD VI PMVTQIAMTDTTPFGQQRVFK 2848 

DEN2-NGC 2798 GSYETKQTGSASSMVNGWRLLTKPWDWPMVTQMAMTDTTPFGQQRVFK 2 847 

DEN3-H87 2796 GSYEVKATGSASSMINGWKLLTKPWDWPMVTQMAMTDTTPFGQQRVFK 2845 
**** ****** **** ^ ******** *****. *************** 

DEN4 2 844 EKVDTRTPQPKPGTRMVMTTTANWLWALLGKKKNPRLCTREEFISKVRSN 2 893 

DEN1-WP 2 84 9 EKVDTRTPKAKRGTAQIMEVTARWLWGFLSRNKKPRICTREEFTRKVRSN 2 898 

DEN2-NGC 2848 EKVDTRTQEPKEGTKKLMKITAEWLWKELGKKKTPRMCTREEFTRKVRSN 2 897 

DEN3-H87 2846 EKVDTRTPRPMPGTRKVMEITAEWLWRTLGRNKRPRLCTREEFTKKVRTN 2895 
******* ** .* ** *** * , * ******** ***.* 

DEN4 2 894 AAIGAVFQEEQGWTSASEAVNDSRFWELVDKERALHQEGKCESCVYNMMG 2 943 

DEN1-WP 2 899 AAIGAVFVDENQWNSAKEAVEDERFWDLVHRERELHKQGKCATCVYNMMG 2 94 8 

DEN2-NGC 2898 AALGAIFTDENKWKSAREAVEDSRFWELVDKERNLHLEGKCETCVYNMMG 2947 

DEN3-H87 2 896 AAMGAVFTEENQWDSARAAVEDEEFWKLVDRERELHKLGKCGSCVYNMMG 2 945 
***** * ** ** * ** ** .** ** *** ******* 

DEN4 2944 KREKKLGEFGRAKGSRAIWYMWLGARFLEFEALGFLNEDHWFGRENSWSG 2993 

DEN1-WP 2949 KREKKLGEFGKAKGSRAIWYMWLGARFLEFEALGFMNEDHWFSRENSLSG 2998 

DEN2 -NGC 2948 KREKKLGEFGKAKGSRAIWYMWLGARFLEFEALGFLNEDHWFSRENSLSG 2997 

DEN3-H87 2946 KREKKLGEFGKAKGSRAIWYMWLGARYLEFEALGFLNEDHWFSRENSYSG 2995 
********** m ***************.********. ****** **** ** 

DEN4 2994 VEGEGLHRLGYILEEIDKKDGDLMYADDTAGWDTRITEDDLQNEELITEQ 3 043 

DEN1-WP 2999 VEGEGLHKLG Y I LRD I SKI PGGNM YADDTAGWDTR I TEDDLQNEAKI TD I 3 048 

DEN2-NGC 2998 VEGEGLHKLG Y I LRD VSKKEGGAM YADDTAGWDTR I TLEDLKNEEMVTNH 3 047 

DEN3-H87 2996 VEGEGLHKLGY I LRD I SKI PGGAM YADDTAGWDTR I TEDDLHNEEKITQQ 3045 
******* # **** * * * ************** **** t * 

DEN4 3044 MAPHHKI LAKAI FKLTYQNKWKVLRPTPRGAVMD 1 1 SRKDQRGSGQVGT 3093 

DEN1-WP 3049 MEPEHALLATSIFKLTYQNKVVRVQRPAKNGTVMDVISRRDQRGSGQVGT 3098 

DEN2-NGC 3048 MEGEHKKLAEAI FKLTYQNKWRVQRPTPRGTVMD I I SRRDQRGSGQVGT 3097 

DEN3-H87 3046 MDPEHRQLANAIFKLTYQNKWKVQRPTPKGTVMDIISRKDQRGSGQVGT 3095 

* * ** *********** * ** * **************** 
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DEN4 3094 YGLNTFTNMEVQLIRQMEAEGVITQDDMQNPKGLKERVEKWLKECGVDRL 3143 

DEN1-WP 3099 YGLNTFTNMEAQL IRQMESEG I FS PSELETPN - LAERVLDWLKKHGTERL 3147 

DEN2-NGC 3098 YGLNTFTNMEAQL I RQMEGEGVFKS I QHLTOT- EE I AVQNWLARVGRERL 3146 

DEN3-H87 3096 YGLNTFTNMEAQL I RQMEGEG VLS KADLENPHPLEKK ITQWLETKG VERL 3145 

********** ******* ** m it* * ** 

DEN4 3144 KRMAISGDDCWKPLDERFGTSLLFLNDMGKVRKDIPQWEPSKGWKNWQE 3193 

DEN1-WP 3148 KRMAI SGDDCWKP IDDRFATALTALNDMGKVRKD I PQWEPSKGWNDWQQ 3197 
DEN2-NGC 3147 SRMAISGDDCWKPLDDRFASALTALNDMGKVRKDIQQWEPSRGWNDWTQ 3196 
DEN3-H87 3146 KRMAI SGDDCWKP IDDRFANALLALNDMGKVRKDIPQWQPSKGWHDWQQ 3195 
************* m *. ** .* *********** **.**.*★ * 

DEN4 3194 VPFCSHHFHKIFMKDGRSLWPCRNQDELIGRARISQGAGWSLRETACLG 3243 

DEN1-WP 3198 VPFCSHHFHQLIMKDGREIWPCRNQDELVGRARVSQGAGWSLRETACLG 3247 

DEN2-NGC 3197 VPFCSHHFHELIMKDGRVLWPCRNQDELIGRARISQGAGWSLRETACLG 3246 

DEN3-H87 3196 VPFCSHHFHELIMKDGRKLWPCRPQDELIGRARISQGAGWSLRETACLG 3245 
********* t ***** ***** *********************** 

DEN4 3244 KAYAQMWSLMYFHRRDLRLASMAICSAVPTEWFPTSRTTWSIHAHHQWMT 3 2 93 

DEN1-WP 3248 KSYAQMWQLMYFHRRDLRLAANAICSAVPVDWVPTSRTTWSIHAHHQWMT 3 2 97 
DEN2-NGC 3247 KSYAQMWSLMYFHRRDLRLAANAICSAVPSHWVPTSRTTWSIHAKHEWMT 3 2 96 
DEN3-H87 3246 KAYAQMWTLMYFHRRDLRLASNAICSAVPVHWVPTSRTTWSIHAHHQWMT 32 95 
*^***** ************ # ******* * *********** _ * m *** 

DEN4 32 94 TEDMLKVWNRWIEDNPNMTDKTPVHSWED I P YLGKREDLWCGSL IGLS S 3 343 

DEN1-WP 3298 TEDMLSVWNRVWIEENPWMEDKTHVSSWEDVPYLGKREDRWCGSLIGLTA 3347 
DEN2-NGC 32 97 TEDMLTVWNRVW I QENPWMEDKTP VE S WEE I P YLGKREDQWCGS L I GLTS 3 34 6 
DEN3-H87 3296 TEDMLTVWNRVWIEDNPWMEDKTPVTTWEDVPYLGKREDQWCGSLIGLTS 3345 
***** *******,** * *** * m ** . . ******** ★******★.. 

DEN4 3 344 RATWAKN I HTAI TQVRNL IGKEE YVD YMP VMKR YS APS E S EGVL 33 87 

DEN1-WP 3 34 8 RATWATNIQVAINQVRRLIGNENYLDFMTSMKRFKNESDPEGALW 33 92 

DEN2 -NGC 3347 RATWAKNIQTAINQVRSLIGNEEYTDYMPSMKRFRREEEEAGVLW 3391 

DEN3-H87 3346 RATWAQNILTAIQQVRSLIGNEEFLDYMPSMKRFRKEEESEGAIW 3390 
***** ** ** *** *** * a *.* *** > * 

* Residue identity 
. Residue similarity 
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. ***** 

[0238] While the present invention has been described in some detail for purposes 
of clarity and understanding, one skilled in the art will appreciate that various changes in 
form and detail can be made without departing from the true scope of the invention. All 
figures, tables, and appendices, as well as patents, applications, and publications, referred to 
above, are hereby incorporated by reference. 
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